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Abstract 

Phosphoeno/pyruvate carboxylase (PEPC) catalyses the initial fixation of atmospheric C0 2 into oxaloacetate and sub- 
sequently malate. Nocturnal accumulation of malic acid within the vacuole of photosynthetic cells is a typical feature 
of plants that perform crassulacean acid metabolism (CAM). PEPC is a ubiquitous plant enzyme encoded by a small 
gene family, and each member encodes an isoform with specialized function. CAM-specific PEPC isoforms probably 
evolved from ancestral non-photosynthetic isoforms by gene duplication events and subsequent acquisition of tran- 
scriptional control elements that mediate increased leaf-specific or photosynthetic-tissue-specif ic mRNA expression. 
To understand the patterns of functional diversification related to the expression of CAM, ppc gene families and pho- 
tosynthetic patterns were characterized in 11 closely related orchid species from the subtribe Oncidiinae with a range 
of photosynthetic pathways from C 3 photosynthesis (Oncidium cheirophorum, Oncidium maduroi, Rossioglossum 
krameri, and Oncidium sotoanum) to weak CAM [Oncidium panamense, Oncidium sphacelatum, Gomesa flexuosa 
and Rossioglossum insleayi) and strong CAM (Rossioglossum ampliatum, Trichocentrum nanum, and Trichocentrum 
carthagenense). Phylogenetic analysis revealed the existence of two main ppc lineages in flowering plants, two main 
ppc lineages within the eudicots, and three ppc lineages within the Orchidaceae. Our results indicate that ppc gene 
family expansion within the Orchidaceae is likely to be the result of gene duplication events followed by adaptive 
sequence divergence. CAM-associated PEPC isoforms in the Orchidaceae probably evolved from several independ- 
ent origins. 

Keywords: Crassulacean acid metabolism, gene duplication, Orchidaceae, Oncidiinae, phosphoeno/pyruvate carboxylase, 
photosynthesis. 



Introduction 

Crassulacean acid metabolism (CAM) is one of three modes 
of photosynthesis found in vascular plants for the assimi- 
lation of atmospheric C0 2 . CAM differs from C 3 and C 4 
photosynthesis in that CAM plants take up C0 2 with low 
water expenditure at night when evaporative demand is low 
(Winter and Smith, 1996ft; Cushman, 2001). CAM is phy- 
logenetically widespread across 343 genera and 35 plant 



families comprising more than 6% of flowering plant spe- 
cies (Griffiths, 1989; Smith and Winter, 1996; Holtum et al, 
2007; Silvera et al, 2010a). The multiple independent origins 
of CAM and the functional convergence of CAM traits in 
the many lineages in which it occurs suggest that the start- 
ing point for CAM evolution might have required relatively 
few genetic changes. However, this notion may be simplistic, 



Abbreviations: AH + , titratable acidity; CAM, crassulacean acid metabolism; GSR gene-specific primer; PEPC, phosphoeno/pyruvate carboxylase; RACE, rapid 
amplification of cDNA ends; RT-PCR, reverse transcription-PCR. 
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given the complexity of the CAM pathway and the fact that 
several biochemical and anatomical requirements, and regu- 
latory changes associated with gene expression patterns, are 
all tightly coordinated in CAM plants (Cushman et al. , 2008; 
Silvera et al, 2010a). To gain insight into the evolutionary 
history of genes recruited for CAM function, the study of 
taxa containing many closely related species with contrast- 
ing photosynthetic pathways is experimentally helpful. In 
this context, we use tropical orchids as a study group, because 
CAM is widespread among epiphytes within this large family 
of vascular plants (Winter and Smith, 1996a; Silvera et al, 
2009, 2010a, b), and because species within the Orchidaceae 
exhibit a gradient of photosynthetic pathways ranging from 
C 3 photosynthesis to weak- and strong-CAM modes (Silvera 
et al, 2005, 2010a). Weak-CAM species show low-level 
CAM activity and typically obtain 5% or less of their car- 
bon through the CAM pathway under well-watered condi- 
tions. Weak CAM appears to be common among neotropical 
orchid species (Silvera et al. 2005). 

In vascular plants, phosphoeno/pyruvate carboxylase 
(PEPC; EC 4.1.1.31) belongs to a multigene family, and 
each member encodes an enzyme with specialized functions 
(Gehrig et al, 1995, 1998, 2001; Chollet a/., 1996; Izuietal, 
2004; O'Leary et al, 2011). In species performing C 4 photo- 
synthesis and CAM, one or more ppc genes encode isoforms 
of the enzyme that catalyse the fixation of atmospheric C0 2 
into C 4 -dicarboxylic acids. In CAM plants, CAM-specific 
isoforms catalyse the nocturnal, irreversible (3-carboxylation 
of phosphoeno/pyruvate in the presence of HC0 3 and Mg 2+ 
yielding oxaloacetate and inorganic phosphate. Oxaloacetate 
is then converted to malate, which is stored as malic acid 
in the vacuole. During the subsequent day, malic acid is 
decarboxylated, resulting in C0 2 release and refutation by 
ribulose-l,5-bisphosphate carboxylase/oxygenase. This 
C0 2 -concentrating mechanism, or 'C0 2 pump', suppresses 
photorespiration and improves water-use efficiency relative 
to that of C 3 and C 4 plant species (Cushman and Bohnert, 
1997). Cytosolic and chloroplastic PEPC enzymes are found 
in photosynthetic organisms from higher plants and green 
algae to cyanobacteria and photosynthetic bacteria, and 
also in non-photosynthetic bacteria and protozoa (Chollet 
et al, 1996; Izui et al, 2004). In all plants, 'housekeeping' or 
non-photosynthetic isoforms of PEPC catalyse anapleurotic 
reactions to replenish biosynthetic precursors for the Krebs 
cycle. PEPC has many other physiological roles in plants 
that include maintaining cellular pH, supplying carbon to 
N 2 -fixing legume root nodules, absorbing and transport- 
ing cations in roots, control of stomatal movements, fruit 
maturation, and seed germination (Latzko and Kelly, 1983; 
Chollet et al, 1996; Echevarria and Vidal, 2003; Izui et al, 
2004; O'Leary et al, 2011; Singh et al, 2012). While PEPC is 
mainly a cytosolic enzyme, a plastid-localized version of the 
enzyme, rice Osppc4, has been described and is involved in 
providing organic acids for ammonium assimilation in leaves 
(Masumoto et al, 2010). 

The currently available molecular data support the view 
that none of the C 4 or CAM enzymes are unique to C 4 
or CAM plants (Westhoff and Gowik, 2004; Gowik and 



Westhoff, 2011), suggesting that these ubiquitous and func- 
tionally diverse isoforms served as starting points for the 
evolution of the C 4 and CAM genes (Cushman and Bohnert, 
1999; Monson, 1999). Furthermore, evidence from PEPC 
and comparative analysis of the C 4 -cycle enzymes in C 3 , 
C 3 -C 4 intermediates, and C 4 species in the genus Flaveria 
suggests that key amino acid residue changes are responsi- 
ble for their acquisition of distinct kinetic and regulatory 
properties (Biasing et al, 2000, 2002; Westhoff and Gowik, 
2004). Similarly, phylogenetic analysis indicates that multiple 
origins of C 4 photosynthesis in grasses and sedges are the 
likely result of recurring selection acting on a few amino acid 
positions of the PEPC enzyme within and across taxonomic 
scales (Christin et al, 2007, 20126; Besnard et al, 2009). 
Phylogenetic analyses also indicate that there was a single 
PEPC origin before the divergence of bacteria and plant 
lineages (Chollet et al, 1996; Izui et al, 2004; Westhoff and 
Gowik, 2004). CAM-specific PEPC isoforms are thought to 
have first evolved in response to water deficit from ancestral 
non-photosynthetic isoforms by gene duplication, followed 
by acquisition of transcriptional control sequences that medi- 
ate leaf- or photosynthetic-tissue-specific increases in mRNA 
expression (Gehrig et al, 2001, 2005; Taybi et al, 2004). For 
example, seven distinct PEPC isoforms were recovered in the 
CAM species Kalanchoe pinnata (Lam.) Pers.: four isoforms 
from leaves and three from roots (Gehrig et al, 2005). In 
Mesembryanthemum crystallinum L., a CAM-specific isoform 
was expressed during the induction of CAM, in addition to 
an uninduced housekeeping isoform (Cushman and Bohnert, 
1999). Based on comparative studies of PEPC in many plant 
taxa including orchids performing CAM, Gehrig et al (2001) 
predicted the clustering of PEPC isoforms according to their 
taxonomic position and specific function. 

Previously, full-length ppc genes were characterized from 
bacteria, several vascular plant species, cyanobacteria, and 
protozoa (Izui et al, 2004). The purpose of our study was to 
reconstruct the evolutionary history of PEPC in Orchidaceae. 
We have characterized the diversity of ppc genes in a phylo- 
genetic context, using partial-length sequences from a closely 
related group of orchid species in the Oncidiinae that express 
photosynthetic pathways ranging from C 3 photosynthesis 
to weak CAM and strong CAM. The results highlight the 
evolutionary diversification of ppc gene families and indicate 
the possible role of gene duplication and recruitment of ppc 
genes for CAM. 



Materials and methods 

Oncidiinae species and characterization 

Eleven closely related species within the Oncidiinae with a range of 
photosynthetic pathways from C 3 photosynthesis [Oncidium chei- 
rophorum Rchb. f., Oncidium maduroi Dressier, Oncidium sotoanum 
R. Jimenez & Hagsater, and Rossioglossum krameri (Rchb. f.) M.W. 
Chase & N.H. Williams] to weak CAM [Gomesa flexuosa (G.Lodd.) 
M.W. Chase & N.H. Williams, Oncidium panamense Schltr., Oncidium 
sphacelation Lindl., and Rossioglossum insleayi (Barker ex Lindl.) 
Garay & G.C. Kenn.] and strong CAM [Rossioglossum ampliatum 
(Lindl.) M.W. Chase & N.H. Williams, Trichocentrum carthagenense 
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(Jacq.) M.W. Chase & N.H. Williams, and Trichocentrum nanum 
(Lindl.) M.W. Chase & N.H. Williams] were chosen as a study group 
for genetic studies of CAM based on carbon isotopic composition 
and titratable acidity measurements (Table 1; Silvera et al, 2005). 
Oncidiinae represents one of the most highly diverse clades of 
orchids from the neotropics, with variation in chromosome number, 
vegetative features, photosynthetic mechanisms, and floral charac- 
teristics (Chase et al. , 2005). A phylogenetic reconstruction for these 
1 1 species was performed using a matrix that included sequences of 
nuclear ribosomal DNA internal transcribed spacer 1 (nrITS-1 and 
-2), plastid DNA regions ycfl (~1200bp portion from the 5' end, 
and ~1500bp portion from 3' end), matK, and the trnH-psbA inter- 
genic spacer, as previously described (Neubig et al, 2012) in each 
species, except for O. maduroi, which included only data for nrITS-1 
and -2 and ycfl . 



infrared gas analyser (Li-Cor, Lincoln, NE, USA) operating in the 
absolute mode for up to 5 d. More than one individual per species 
was measured. The temperature inside the cuvette and chamber was 
25 °C during the day and 22 °C during the night under an ambient 
C0 2 concentration with a dew point of 1 8 °C and a light intensity of 
300 umol m 2 s _1 during the 12 h light period. 

Titratable acidity determinations 

Titratable acidity (AH + ) measurements were conducted as described 
previously and are presented as the difference between the 
meanistandard deviation of three replicate samples at dawn and 
dusk (Silvera et al, 2005). Nocturnal acid accumulation reflects the 
magnitude of dark C0 2 fixation but does not provide information 
on the uptake of atmospheric C0 2 via C 3 photosynthesis in the light. 



Gas-exchange measurements 

Photosynthetic gas exchange was measured on attached, mature 
leaves of plants from each of the 11 orchid species targeted for 
this study. Measurements of 24 h C0 2 exchange demonstrate the 
proportions of C0 2 fixed in the light and dark, respectively, and 
thus allow conclusions about the degree to which plants engage 
in CAM relative to C 3 photosynthesis. Leaves were sealed inside a 
Plexiglass® cuvette located within a controlled-environment cham- 
ber (Environmental Growth Chambers, Chagrin Falls, Ohio, USA). 
The hole in the cuvette through which the leaf was inserted was 
sealed with a non-porous synthetic rubber sealant (Terostat VII; 
Henkel-Teroson, Heidelberg, Germany). Prior to measurements, 
plants were well watered, fertilized once per week with a commercial 
20:20:20 and/or 16:32:16 (N:P:K) fertilizer solution, and maintained 
in an open greenhouse. The diel temperature range within the green- 
house varied from a minimum of 20 °C to a maximum of 32.2 °C, 
and relative humidity varied from 80 to 100%. Daily light availabil- 
ity ranged from 5 to 70% of full sunlight and corresponded roughly 
to the natural light exposure of these species in the field. Plants 
were kept in their growth medium, which consisted of lightweight 
pumice aggregates and coarse synthetic sponge, to avoid dehydra- 
tion before experiments. Due to the small size of T. nanum leaves, 
an entire plant of this species, including roots, was placed inside the 
cuvette. Net C0 2 exchange was measured continuously using a flow- 
through gas-exchange system (Walz, Effeltrich, Germany) operat- 
ing at an airflow rate of 1.3 1 mm 1 and monitored with a LI-6252 



Plant material 

Leaf samples for ppc gene analysis were obtained from mature leaves 
of mature orchids for 10 of the 11 species targeted for this study 
(O. cheirophorum was excluded from the gene analysis due to insuf- 
ficient leaf material). Plants were grown in closed greenhouses at 
the University of Nevada, Reno, and the University of California, 
Riverside. Daily temperatures within the greenhouse varied from 17 
to 35 °C, relative humidity varied from 40 to 80%, and the mean 
photon flux density was 200 umol mr 2 s . Plants were watered 
daily and nutrients were supplied once per week with a combina- 
tion of slow-release fertilizer (Osmocote® 19-6-12 formula; Scotts 
Company, Marysville, OH, USA) and commercial fertilizer solution 
(Schultz® 19-31-17 formula; Spectrum Brands, Madison, WI, USA). 
Healthy leaves from each of 10 species were collected at 2 p.m. and 
2 a.m. to account for putative circadian differences in the relative 
expression abundance of PEPC mRNAs. Root samples were also 
collected at the same times from mature plants and stored separately. 
All samples were flash frozen in liquid nitrogen immediately after 
harvesting, and stored at -80 °C until isolation of total RNA. 

RNA extraction 

Total RNA was extracted using an RNeasy Midi kit (Qiagen, 
Valencia, CA, USA) with a modified polyethylene glycol RNA 
extraction method including high-molecular-weight polyethylene 
glycol (Gehrig et al. , 2000), which has proven successful for RNA 



Table 1 . Values of b 13 C, leaf thickness, and titratable acidity for 1 1 species from the Oncidiinae 

Titratable acidity (AH + ) is represented as the difference between the mean+standard deviation of three replicates at morning and evening (Silvera 
et al., 2005, and this study). Species are listed in order based on AH + from C 3 photosynthesis to weak CAM and strong CAM. FW, fresh weight; 
NS, not significant. 



Species name 


Leaf 5 13 C (%«) 


Leaf 

thickness (mm) 


H + (evening) 
(umol H + g" 1 FW) 


H + (morning) 
(umol H + g" 1 FW) 


AH + 


Photosynthetic 
pathway 


Oncidium sotoanum 


-25.2 


0.25 


2.9 + 0.4 


2.7 + 1.6 


-0.2 NS 


c 3 


Oncidium cheirophorum 


-27.4 


0.36 


29.9 + 3.3 


30.2 + 2.7 


0.3 NS 


c 3 


Rossioglossum krameri 


-31.7 


0.35 


14.4 + 2.5 


14.8 + 2.4 


0.4 NS 


c 3 


Oncidium maduroi 


-24.7 


0.24 


17.3 + 1.9 


19.5 + 0.8 


2.2 NS 


c 3 


Rossioglossum insleayi 


-22.5 


1.10 


16.3 + 0.5 


34.9+12 


18.60= 


Weak CAM 


Oncidium panamense 


-26.2 


0.54 


11.5 + 0.7 


33.2 + 0.3 


21.7" 


Weak CAM 


Oncidium sphacelatum 


-27.9 


0.53 


8.3 + 6.3 


31.2 + 2.1 


22.9 a 


Weak CAM 


Gomesa flexuosa 


-24.4 


0.26 


37.9 + 9.4 


74.0+10.4 


36. 1 a 


Weak CAM 


Trichocentrum nanum 


-17.2 


3.40 


18.8 + 4.1 


57.3 + 5.2 


38.50° 


Strong CAM 


Trichocentrum carthagenense 


-12.2 


2.32 


12.5 + 0.4 


77.3 + 3.4 


64.8 a 


Strong CAM 


Rossioglossum ampliatum 


- 15.3 


1.59 


5.5 + 1.3 


153.5 + 3.6 


148.0 a 


Strong CAM 



3 Denotes significance between means of the morning and evening at P<0.05 as determined by Student's f-test. NS, not significant. 
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isolation from succulent and non-succulent orchid tissues. RNA 
integrity was examined by 1% agarose gel electrophoresis, and 
RNA quality and concentration were examined using a NanoDrop 
ND-1000 UV-V spectrophotometer (NanoDrop Technologies, 
Rockland, DE, USA). 

Reverse transcription (RT)-PCR amplification and cloning 

An 11 00 bp fragment was amplified by RT-PCR using two 
degenerate primers, as defined previously for PEPC (Taybi 
et at, 2004; Gehrig et al, 2005; Kore-eda et al, 2005) and modi- 
fied slightly for orchid specificity. The degenerate primers were: 
S'-TCNGAYTCNGGVAARGAYGC-S' (forward) and reverse 
5'-GCDGCRATRCCYTTCATKG-3' (reverse). Using a One-Step 
RT-PCR kit (Qiagen), 500 ng of total RNA was reverse transcribed 
and amplified following the manufacturer's instructions. Final con- 
centrations of the reaction components were 400 uM for each dNTP, 
lx Qiagen One-Step RT-PCR Buffer containing 12.5mM MgCl 2 , 
2 ul of Qiagen One-Step RT-PCR Enzyme Mix, and 1 uM PEPC 
forward and reverse primers. The following temperature cycling con- 
ditions were used: reverse transcription at 50 °C for 30min, initial 
PCR activation at 95 °C for 15min, amplification for 39 cycles at 
94 °C for 1 min, 40 °C for 1 min, and 72 °C for 1 min, and final exten- 
sion at 72 °C for 10 min. RT-PCR was used to amplify a fragment 
of 11 00 bp encompassing the C-terminal third of the PEPC cod- 
ing region. By using this partial sequence, distinct isoforms could be 
distinguished without the need to isolate the full 3000 bp sequence, 
because this fragment was variable enough to allow differentiation 
among isoforms. The sequence also has a highly conserved active 
site that facilitates correct alignment and identification (Gehrig 
et al. ,2001; Izui et al. , 2004) (Supplementary Fig. S2 at JXB online)" 
In this regard, the use of partial rather than full-length PEPC cDNA 
sequences has proven useful for molecular phylogenetic and taxo- 
nomic comparisons across species, thus saving time and financial 
resources for researchers interested in using PEPC as a molecular 
marker (Gehrig et al, 2001). The RT-PCR products were purified 
by agarose gel electrophoresis, recovered using a QIAquick Gel 
Extraction kit (Qiagen), cloned into the TA-TOPO cloning vector 
pCR2.1 vector system (Invitrogen™ Life Technologies, Carlsbad, 
CA, USA), and transformed into XLl-Blue or TOP10 competent 
Escherichia coli cells following the manufacturer's instructions. 
Bacterial cells containing plasmids from 100-150 randomly selected 
clones per orchid species were grown in Terrific Broth liquid medium 
for 16 h at 37 °C with vigorous shaking. The bacterial cells were then 
harvested by centrifugation at 13 000g, and the plasmids were puri- 
fied using a Qiagen Plasmid Mini kit (Qiagen) following the manu- 
facturer's instructions. Selected cDNA clones were then analysed by 
EcoRl restriction enzyme digestion and electrophoresis on 1% aga- 
rose gels stained with 0.5 ug ml -1 of ethidium bromide. Selected plas- 
mids were sequenced at the Nevada Genomics Center, University of 
Nevada, Reno, with a ABI BigDye™ Terminator Cycle Sequencing 
Ready Reaction kit, v3.1, and an ABI 3730 Sequence Analyzer 
(Applied Biosystems, Life Technologies, Grand Island, NY, USA), 
using the M13 forward (5'-TGTAAAACGACGGCCAGT-3') and 
reverse (5'-GAGCGGATAACAATTTCACACAG-3') primer sets. 
Over 1200 cDNA clones were sampled and sequenced from 10 of 
the 1 1 closely related Oncidiinae species targeted for this study, and 
1000 cDNAs were selected and identified as PEPC isoforms (100 
clones per species). 



PEPC sequence analysis 

Raw sequences were edited manually by removing vector sequences 
using Mac Vector v. 11.1 software (Mac Vector, Cary, NC, USA). 
Forward and reverse PEPC fragments were then assembled in 
Mac Vector. Over 1200 assembled sequences were verified by iden- 
tifying conserved amino acid sequences using the Basic Local 
Alignment Search Tool (BLAST) within the non-redundant data- 
base at the National Center for Biotechnology Information, and 



were translated into the corresponding amino acid sequences. 
Multiple sequence alignments for both nucleotide and protein 
sequences were used to visually identify distinct PEPC isoforms 
within each species separately. Sequences that were identical within 
a species were considered the same isoform. PEPC isoforms present 
in each species were then named based on their relative abundance, 
so that ppcl-ol (letter 'o' for Oncidiinae) would correspond to the 
most abundantly transcribed isoform recovered by clone sampling 
in each species, followed by ppcl-o2,ppcl-o3,ppcl-o4,ppcl-o5, and 
ppcl-o6, respectively. By naming isoforms in this fashion, we could 
easily identify the isoforms that are abundantly expressed in leaves 
and roots when perfoming phylogenetic analysis. All isoforms were 
then renamed based on their position in the PEPC phylogenetic 
analysis (ppcl-Ml-ol, ppcl-Ml-o2, ppcl-M2-ol , ppc-M2-o2, and 
so forth). 



Rapid amplification of cDNA ends (RACE) amplification 

The 3' ends of the PEPC cDNA fragments for three species 
(O. maduroi, O. panamense, and R. ampliatuni) were recovered using 
the 3' RACE system (SMART™ RACE cDNA Amplification, 
BD Bioscience Clontech, Mountain View, USA) following the 
manufacturer's instructions and using gene-specific primers (GSPs) 
based on sequences obtained by the initial degenerate RT-PCR 
(Supplementary Table SI at JXB online). RACE and larger ampli- 
fied cDNA products were then sequenced using a BigDye™ 
Terminator Sequencing kit and an ABI 3730 Sequence Analyzer in 
the Nevada Genomics Center at the University of Nevada, Reno. 
Sequences were edited manually for base-call inaccuracies, and 
vector sequences were removed using Mac Vector v. 1 1 . 1 software. 
Sequences were then assembled into the 11 00 bp isoform fragments 
for each species using the assembly project function in MacVector. 
To identify sequencing errors or chimaeras that were potentially 
formed during RACE, GSPs were also designed to confirm iso- 
form identities (Supplementary Table S2 at JXB online). We only 
performed RACE amplification in three of the 10 species for which 
tissue was abundantly available. 



Phylogenetic analyses of ppc 

Multiple sequence alignments that included 39 PEPC isoforms 
from the study species and 273 ppc sequences available in GenBank 
and Phytozome (Goodstein et al., 2012) were obtained using the 
MUSCLE function (Edgar, 2004) in MEGA v.5.2.2 (Tamura et al., 
2011) and refined manually. The ppc sequences downloaded from 
GenBank included those from several orchid genera with CAM 
species, such as Angraecum, Dendrobium, Epidendrum, Leptotes, 
Microcoelia, Phalaenopsis, Solenagis, Taeniophyllum, and Vanilla. 
The ppc gene sequences from 63 plant genera and a sequence from 
the alga Chara fragilis Desv. (Supplementary Table S3 at JXB 
online) were also included in the alignment. Because phyloge- 
netic trees based on ppc gene sequences from broad phylogenetic 
sampling have been shown consistently to produce two distinct 
lineages that are highly homologous but that diverged before the 
evolution of land plants (ppc-1 and ppc-2), we focused our analy- 
sis only on sequences from ppc-1, which contain all of the CAM 
and C 4 -specific ppc genes (Gowik and Westhoff, 2011; Christin 
et al., 2014). By using only ppc-1 genes, we avoided ambiguities in 
the alignment files that could then be reflected in the phylogenetic 
analysis (Christin et al, 2014). Phylogenetic trees were inferred 
with MrBayes v.3.2.1 (Ronquist and Huelsenbeck, 2003) following 
a general time-reversible model of nucleotide substitution with a 
y-shaped parameter and a proportion of invariants (GTR+G+I). 
Two analyses were performed in parallel; each was composed of 
16 chains, run for 20 000 000 generations, sampling a tree every 
1000th generation after a burn-in period of 7 000 000 generations. 
A consensus tree was computed after the burn-in period. The con- 
vergence and appropriateness of the burn-in period were checked 
with Tracer (Rambaut and Drummond, 2007). 
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Results 

Photosynthetic patterns and relationship of 1 1 
Oncidiinae species 

Among the 1 1 Oncidiinae species used in this study, the genus 
Rossioglossum contained C 3 , weak-CAM, and strong-CAM 
species. Oncidium was represented by both weak-CAM and 
C 3 species, whereas Gomesa was represented by one weak- 
CAM species and Trichocentrum contained two strong- 
CAM species (Fig. 1). The phylogenetic relationship of the 
11 Oncidiinae species used in this study (Fig. 1) followed the 
nomenclature proposed by Neubig et al. (2012) and is con- 
sistent with the phylogenetic relationships within Oncidiinae 
sensu Chase inferred from 590 species (Neubig et al, 2012). 
O. maduroi, O. sotoanum, O. cheirophorum, and R. krameri 
showed no nocturnal net C0 2 uptake and therefore were 



considered C 3 species (Fig. 2). G. flexuosa, O. panamense, 
O. sphacelation, and R. insleayi clearly exhibited nocturnal 
net C0 2 uptake, but its contribution to total carbon gain was 
small compared with C0 2 uptake during the light period, so 
they were classified as weak-CAM species (Fig. 3). In these 
weak-CAM species, the majority of the C0 2 was taken up 
during the day followed by a small amount of C0 2 loss dur- 
ing the beginning of the night and limited net C0 2 uptake 
throughout the night. R. ampliatum, T. carthagenense, and 
T. nanum were considered strong-CAM species based on 
their pronounced net C0 2 uptake during the night (Fig. 4). 
T. nanum (a strong-CAM species; Fig. 4, bottom panel) 
showed daytime net C0 2 uptake during the afternoon hours. 
However, the majority of its net C0 2 uptake occurred dur- 
ing the middle of the night. Root respiration for T. nanum 
was included because the whole plant was measured within 
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Fig. 1 . Phylogenetic relationship of 1 1 species from the Oncidiinae with photosynthetic pathways ranging from strong CAM (red shading) to weak 
CAM (yellow shading), and C 3 photosynthesis (green shading). The phylogeny was reconstructed using sequences of nrlTS- 1 and -2, plastid DNA ycfl , 
matk, and the trnti-psbA intergeneric spacer. Eulophia graminea was used as the outgroup. Values on the branches represent bootstrap support (%). 
Accession numbers next to each species correspond to the voucher specimens deposited at the University of Florida Herbarium (FLAS). Representative 
images of corresponding floral morphology are shown to the right of each species designator. Photos by K. Silvera. 
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Fig. 2. Continuous net C0 2 exchange by C 3 photosynthesis orchid 
species during a 1 2 h light (open bar)/1 2 h dark (filled bar) cycle period. 



Fig. 3. Continuous net C0 2 exchange by weak-CAM orchid species 
during a 1 2 h light (open bar)/12h dark (filled bar) cycle period. 



the cuvette due its small size. Compared with the other two 
strong-CAM species for which only leaf-gas exchange was 
determined, the presence of root respiration in the T. nanum 
experiment resulted in considerable C0 2 loss during the early 
light and dark periods. Nonetheless, the strong-CAM charac- 
ter of the 24 h C0 2 gas-exchange pattern in this species was 
evident. 

Identification of multiple PEPC isoforms within 
Oncidiinae species 

Transcripts of between two and six isoforms of PEPC were 
sufficiently abundant to be recovered from each of the 10 
species studied, based on our sampling strategy. Species per- 
forming C 3 photosynthesis expressed two to three isoforms, 
whereas species with weak CAM expressed three to four iso- 
forms, and species with strong CAM expressed four to six iso- 
forms (Table 2). Multiple nucleotide sequence alignment of 
the 1100 bp PEPC fragments from 39 isoforms recovered in 
this study and 273 sequences downloaded from GenBank and 
Phytozome showed 162 informative sites, 192 sites without 
gaps, and 186 variable sites. The ppc-1 consensus tree gener- 
ated from nucleotide sequence information from 312 species 



revealed two main lineages in flowering plants (eudicots and 
monocots; Fig. 5). The ppc sequences from the algal species 
C. fragilis, and those present in Bryophyta (Physcomitrella) 
and Lycopodiophyta (Selaginella), were sister to the ppc 
sequences from gymnosperms and angiosperms. The ppc lin- 
eage composed of Pyrrosia, a CAM fern (Fig. 5), and the 
aquatic CAM genus Isoetes was basal to the angiosperm and 
gymnosperm ppc lineages (Fig. 5). 

The eudicot lineage was monophyletic and composed of 
two well-supported clades (ppcl-El and ppcl-E2; Fig. 5 and 
Supplementary Fig. SI). The position of these clades was con- 
sistent with those described by Christin and Besnard (2009) 
and Christin et al. (2014). Gene duplication events that led 
to the six gene lineages of ppc in C 4 grasses are shown within 
the monocot lineages (namely ppc-aLla, ppc-aLlb, ppc-aL2, 
ppc-aR, ppc-Bl, and ppc-B2, Figs 5 and 6) and follow the 
nomenclature proposed by Christin and Besnard (2009). The 
monocot lineage in the current study appeared to be para- 
phyletic because some monocot Orchidaceae ppc sequences 
were embedded within the eudicots ppcl-El lineage (Fig. 5, 
represented by the grey clade). 

Gene duplication events led to three ppc lineages in the 
Orchidaceae: two well-supported lineages within the monocot 
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Fig. 4. Continuous net C0 2 exchange by strong-CAM orchid species 
during a 1 2 h light (open bar)/12h dark (filled bar) cycle period. 

lineage (ppcl-Ml and ppcl-M2; Fig. 6), and a third line- 
age, which is embedded within the eudicot ppcl-El lineage 
(Fig. 5). The monocot lineage contained all of the 39 tran- 
scribed isoforms recovered in this study {ppcl-Ml and ppcl- 
Ml; Fig. 6). The most abundantly transcribed isoform of all 
the Oncidiinae species recovered in this study grouped within 
the ppcl-Ml lineage (Fig. 6, ppcl-Ml-ol represented in red, 
orange, and green). The most abundantly transcribed iso- 
forms for weak-CAM species (e.g. G.flexuosa ppcl-Ml-ol and 
ppcl-Ml-o2, O. sphacelatum ppcl-Ml-ol and ppcl-Ml-o2, 
O. panamense ppcl-Ml-ol and ppcl-Ml-o2, and R. insleayi 
ppcl-Ml-ol and ppcl-Ml-o2, represented in orange in Fig. 6) 
clustered together with the most abundantly transcribed 
isoforms for strong-CAM species (e.g. R. ampliatum ppcl- 
Ml-ol and ppcl-Ml-o2, T. carthagenense ppcl-Ml-ol and 
ppcl-Ml-o2, and T. nanum ppcl-Ml-ol, represented in red 
in Fig. 6). The most abundantly transcribed isoform for C 3 
species (e.g. O. maduroi ppcl-Ml-ol , R. krameri ppcl-Ml-ol , 
and O. sotoanum ppcl-Ml-ol, represented in green in Fig. 6) 
also clustered with the most abundantly transcribed isoforms 
for weak- and strong-CAM species. The phylogenetic similar- 
ity between the two most abundantly transcribed isoforms for 
weak- and strong-CAM species {ppcl-Ml-ol and ppc-Ml-o2) 
and the most abundantly transcribed isoform for C 3 species 
{ppcl-Ml-ol) suggested that these orthologous isoforms have 
similar expression patterns and abundance and are prob- 
ably involved in the same functions. The ppc sequences from 
Oncidiinae genera such as Gomesa, Oncidium, Rossioglossum, 
and Trichocentrum tended to separate into two distinct groups 



{ppcl-Ml and ppcl-M2), and this pattern was also consist- 
ent within orchid genera in which several isoforms were 
reported such as Vanilla, Leptotes, and Dendrohium (Fig. 6). 
Within the monocot lineage, Orchidaceae ppc sequences fol- 
lowed the species relationships in both Ml and M2 clusters. 
For example, ppcl-Ml was composed of ppc sequences from 
closely related Oncidiinae species (species within Gomesa, 
Oncidium, Rosioglossum, and Trichocentrum; Fig. 6), and 
ppcl-Ml was sister to ppc sequences from other species 
in the subfamily Epidendroideae (species within subtribes 
Laeliinae, Aeridinae, Angraecinae, Aerangidinae; Fig. 6). 
The ppc sequences from the subfamily Epidendroideae were 
sister to ppc sequences from Vanilla (subfamily Vanilloideae) 
and T. nanum ppcl-Ml-o5 (Fig. 6). A gene duplication event 
probably occurred before the diversification of Oncidiinae 
ppcl-Ml and ppcl-M2 (Fig. 6, represented by a red filled cir- 
cle). At least 10 gene duplication events were apparent within 
Oncidiinae ppc lineages and were related to ppc gene expan- 
sion in weak-CAM and strong-CAM species (Fig. 6, repre- 
sented by magenta filled circles in ppcl-Ml and ppcl-M2). 
Eight gene duplication events occurred within Oncidiinae 
ppcl-Ml, four of which were linked to gene family expansion 
in lineages leading to the weak-CAM species O. sphacelatum, 
O. panamense, and R. insleayi, and four were linked to gene 
family expansion in lineages leading to the strong-CAM spe- 
cies R. ampliatum, T. carthagenense, and T. nanum (Fig. 6). 
Within Orchidaceae ppcl-M2, two gene duplication events 
were linked to gene family expansion, one of which occurred 
in a lineage leading to R. ampliatum, and in another leading 
to T. nanum (Fig. 6, represented by filled circles in ppcl-M2). 



Discussion 

The Oncidiinae is one of the most diverse clades within the 
Orchidaceae, with a wide range of contrasting characteristics 
such as vegetative morphology, floral variation, chromosome 
number, and pollination systems (Neubig et al, 2012). Our 
results indicate that Oncidiinae also shows contrasting pho- 
tosynthetic types, as demonstrated by gas exchange, titratable 
acidity, leaf thickness, and isotopic composition measure- 
ments (Table 1, Figs 2-4). Most species within the Oncidiinae 
are epiphytes in habitats with intermittent water availability, 
and many exhibit CAM photosynthesis (Silvera et al, 2009). 
The degree of CAM expression in orchid species corre- 
lates with leaf thickness or succulence (Winter et al, 1983), 
reduced intercellular air spaces, and large mesophyll cell size 
(Nelson et al, 2005), with a minimum necessary cell vol- 
ume per unit leaf area for nocturnal acid storage. Therefore, 
thin-leaved, weak-CAM species are predicted to exhibit a 
limited degree of nocturnal net C0 2 uptake (Fig. 3) when 
compared with thick-leaved, strong-CAM species (Fig. 4). 
Indeed, weak-CAM species in the Oncidiinae showed pat- 
terns of nocturnal C0 2 uptake typical of CAM species but at 
a greatly reduced magnitude. Although their 5 13 C values were 
within the C 3 range, these species exhibited statistically sig- 
nificant differences between evening and morning titratable 
acidity, which is indicative of weak CAM. Species with 6 13 C 
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Table 2. PEPC isoform counts from 10 species from Oncidiinae based on relative abundance of clone sampling 
Four additional isoforms were recovered using GSP by RACE amplification. 



Species name 


Isogene 


Relative abundance (%) 


Functional designation 


Oncidium maduroi (C 3 ) 


ppc1-M1-o1 


88 


c 3 




ppc1-M2-o2 


11 


C 3 




ppc1-M1-o3 


1 


c 3 


Rossioglossum krameri (C 3 ) 


ppc1-M1-o1 


92 


c 3 




ppc1-M2-o2 


8 


c 3 


Oncidium sotoanum (C 3 ) 


ppc1-M1-o1 


86 


c 3 




ppc1-M2-o2 


10 


c 3 




ppc1-M1-o3 


4 


c 3 


Gomesa flexuosa (weak CAM) 


ppc1-M1-o1 


66 


CAM 




ppc1-M1-o2 


31 


c 3 




ppc1-Ml-o3 


2 


c 3 




ppc1-M2-o4 


1 


c 3 


Oncidium sphacelatum (weak CAM) 


ppd-Ml-o1 


96 


CAM 




ppc1-Ml-o2 


2.5 


c 3 




ppc1-Ml-o3 


1 


c 3 


Oncidium panamense (weak CAM) 


ppc1-M1-o1 


73 


CAM 




ppc1-M1-o2 


19 


c 3 




ppc1-M2-o3 


7.5 


c 3 




ppc1-M1-o4 


by RACE 


c 3 


Rossioglossum insleayi (weak CAM) 


ppc1-M1-o1 


76 


CAM 




ppc1-M1-o2 


11 


c 3 




ppc1-M2-o3 


10 


c 3 




ppc1-M1-o4 


2 


C 3 


Rossioglossum ampliatum (CAM) 


ppc1-M1-o1 


70 


CAM 




ppc1-Ml-o2 


20 


C 3 /CAM 




ppc1-M2-o3 


5 


c 3 




ppc1-M2-o4 


5 


C 3 




ppc1-M1-o5 


by RACE 


c 3 




ppc1-M2-o6 


by RACE 


c 3 


Trichocentrum carthagenense (CAM) 


ppd-Ml-o1 


84 


CAM 




ppc1-Ml-o2 


9 


C 3 /CAM 




ppc1-M2-o3 


4 


c 3 




ppc1-M1-o4 


3 


C 3 




ppd-M1-o5 


by RACE 


c 3 


Trichocentrum nanum (CAM) 


ppc1-Ml-o1 


59 


CAM 




ppc1-M2-o2 


20 


C 3 /CAM 




ppc1-M1-o3 


16 


c 3 




ppc1-M2-o4 


4 


c 3 




ppc1-M1-o5 


1 


c 3 



values characteristic of C 3 species can obtain up to one-third 
of their carbon via the CAM pathway (Winter and Holtum, 
2002). In contrast, gas-exchange measurements for the thin- 
leaved C 3 photosynthetic species O. maduroi, O. sotoanum, 
O. cheirophorum, and R. krameri showed net C0 2 uptake 
exclusively during the daytime and a slight C0 2 loss during 
the night (Fig. 2). Strong-CAM species with thick leaves such 
as R. ampliatum, T. carthagenense, and T. nanum exhibited 
most of their carbon gain at night (Fig. 4). 

Differences in PEPC isoform numbers and relative mRNA 
abundance using 10 closely related Oncidiinae species were 
associated with the capacity to perform CAM, as measured by 
24 h net C0 2 gas exchange. Based on our RT-PCR approach, 
as many as six isoforms were observed in weak- and strong- 
CAM species, and one to two putative CAM-specific PEPC 



isoforms were identified based on their relative abundance 
(Table 2) and position in the phylogenetic tree (Fig. 6, orthol- 
ogous ppcl-Ml-ol and ppc-Ml-o2 sequences represented in 
red and orange). The increase in number of PEPC isoforms 
associated with weak- and strong-CAM species could be 
the result of a single gene duplication event, gene duplica- 
tion due to polyploidy, or different alleles of the same gene. 
Oncidiinae CAM species have a significantly higher DNA 
content compared with weak-CAM and C 3 species (J.C. 
Cushman, unpublished data). A possible explanation for the 
expansion of the ppc gene family is that repeated genome 
duplication events leading to polyploidy in orchids have cre- 
ated an increased pool of duplicated genes and alleles suit- 
able for CAM. Therefore, the presence of multiple genomes 
might confer an advantage for adaptive evolution (Hegarty 
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Fig. 5. Phylogenetic relationship of 312 ppc- 7 gene sequences with emphasis on the Orchidaceae (shaded in grey). The phylogenetic tree was 
obtained through Bayesian analysis. Taxonomic groups are compressed based on plant orders, with the size of the triangle proportional to the number 
of sequences present in each clade. Orders are represented in bold, and the main ppc lineages are depicted on the right. Two main ppc lineages are 
highlighted in the tree (eudicots and monocots). The filled circle represents the duplication event before the split between eudicots and monocots. The 
asterisk represents a sequence of Cycas revoluta nested within a Kalanchoe clade (Saxifragales). Detailed information is available in Supplementary Fig. 
S1 at JXB online. Bar, expected substitutions per site. 
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and Hiscock, 2007). Genomic information is needed to verify 
the number of isoforms present in each species and the sub- 
sequent orthology or paralogy of the ppc sequences found in 
this study, and to identify missing gene lineages that could 
not be recovered with our PCR-based sampling approach. 
Even so, the present study suggests that the ppc gene fam- 
ily in Oncidiinae orchids has probably undergone gene fam- 
ily expansion during the evolutionary establishment of weak 
and strong CAM as evidenced by at least eight gene duplica- 
tion events revealed by the phylogenetic tree analysis. 

The identification of two distinct ppc clades in flowering 
plants indicates that these two lineages evolved independently 
of each other, and that this event occurred early in the diversi- 
fication of ppc lineages (Christin and Besnard, 2009; Christin 
et al. , 20126). Within the Oncidiinae, a gene duplication event 
occurred early in the diversification of ppcl-Ml and ppcl-M2 
lineages. Several ppcl-Ml genes, presumably derived from 
gene duplication events (Fig. 6, represented by filled circles), 
evolved a role in CAM photosynthesis (Fig. 6, represented 
by red and orange sequences), whereas genes within ppcl-M2 
maintained anapleurotic roles. This phylogenetic grouping of 
PEPC isoforms into distinct sister clades [ppcl-Ml and ppcl- 
Ml) was also evident for other subtribes within the subfamily 
Epidendroideae (e.g. subtribe Laeliinae, subtribe Aeridinae, 
and subtribe Dendrobiinae; Fig. 6) and for the subfamily 
Vanilloideae PEPC isoforms. 

Distinct Oncidiinae PEPC isoforms associated exclusively 
with root tissue were not identified in the current study. Gehrig 
et al. (2005) found three root isoforms of PEPC, each of which 
contained an insertion of 8 aa towards the C-terminal end of 
the enzyme in K. pinnata, a strong-CAM species. Putative root 
PEPC isoforms from the orchid species studied here showed 
isoforms that were identical to those found in leaves, and 
no insertions were evident. Because aerial roots in epiphytic 
orchids can engage in C 3 photosynthesis (except for leafless 
orchids in which roots perform CAM; Winter et al, 1985), 
the ppc genes found in roots of epiphytic orchids are most 
likely the same as those found in leaves (Kwok-ki et al. , 1983). 
Interestingly, T. nanum ppcl-Ml-o5 (Fig. 6) was quite distinct 
from all other isoforms in the Oncidiinae ppc sequences, and 
was positioned close to Vanilla, a distantly related orchid 
group (Fig. 6). This very-low-abundance isoform might be 
difficult to recover, which might explain why it was not previ- 
ously reported in closely related species. The many low-abun- 
dance isoforms (ppcl-o3 to ppcl-o6) found in this study might 
be the result of functional diversification of paralogous genes 
involved in non-photosynthetic PEPC functions (i.e. house- 
keeping or anapleurotic functions). The finding of a third ppc 
Orchidaceae lineage composed of three genera (Microcoelis, 
Leptotes, and Solenangis) outside the monocots and nested 
within the eudicots, and a Cycas revoluta ppc sequence nested 
within Kalanchoe sequences, is puzzling. Further investiga- 
tion is needed to rule out the possibility that these sequences 
are the result of contamination or horizontal gene transfer. 
Additional quantitative and temporal expression analysis of 
mRNAs for all isoforms in this study is needed to confirm 
their putative functional contributions to CAM. Similarly, 
more genomic and transcriptomic data from orchid species 



are needed to confirm and identify ppc lineages in orchid 
genomes, and to recover isoforms that were missed due to 
potentially incomplete PCR-based sampling. 

The PEPC isoforms most abundantly transcribed in C 3 
Oncidiinae species clustered closely with those of strong- 
CAM and weak-CAM species (Fig. 6, Oncidiinae species 
sequences highlighted in colors, ppcl-Ml), suggesting that 
there is no specific convergence of amino acid changes or 
selective pressure towards amino acid changes linked to CAM 
function, as there is in C 4 species (Christin et al, 20126). The 
most abundantly transcribed isoforms in cDNAs isolated 
from leaf and root photosynthetic tissues of C 3 , weak-CAM, 
and strong-CAM Oncidiinae species were orthologous, sug- 
gesting that these isoforms might be involved in similar func- 
tions and that they have a role in nocturnal C0 2 uptake in 
species with weakly and strongly expressed CAM. All of the 
other Orchidaceae ppc sequences available from GenBank 
and used in this study belonged to strong-CAM species. 
There were no ppc sequences available from C 3 orchid rela- 
tives, making it impossible to test whether ppc sequences from 
potentially closely related C 3 species within the subfamilies 
Epidendroideae and Vanilloideae would have clustered in the 
same manner as those from strong-CAM species. This defi- 
ciency highlights the utility of conducting gene family surveys 
with closely related species with contrasting photosynthetic 
pathways to elucidate the molecular genetic underpinnings of 
photosynthetic pathway evolution. 

Several scenarios could explain the diversification of 
PEPC isoforms. In one scenario, a ppc gene duplication event 
occurred early in the diversification of plants producing two 
clades: one clade with several duplicated sequences, one of 
which underwent recruitment for CAM through neofunc- 
tionalization, while the other clade contained sequences that 
retained the ancestral function. In general, neofunctionaliza- 
tions require changes in gene expression (Brautigam et al, 
201 1; Gowik and Westhoff, 201 1) and/or amino acid substitu- 
tions to confer an entirely new function (Zhang, 2003). A sec- 
ond scenario involves a change in regulation. In this scenario, 
a gene duplication event from a ppc ancestral gene with dual 
functionality underwent regulatory changes that determined 
whether it would perform in CAM or C 3 photosynthesis. This 
implies that neofunctionalization does not need to be linked 
to changes in amino acid positions, and that C 3 paralogous 
genes can be recruited for CAM function through subfunc- 
tionalization, in which one of the duplicated genes becomes 
better at performing one of the functions of the progenitor 
genes (Hughes, 1999; Zhang, 2003). Perhaps the most likely 
scenario in the evolution of PEPC in orchids is the former, as 
suggested by the close clustering of ppc sequences from C 3 , 
weak-CAM, and strong-CAM species that exhibited greater 
transcript abundance (Fig. 6, orthologous ppcl-Ml genes, 
represented in red, orange, and green) relative to the other 
gene family members. This scenario suggests that increased 
transcript abundance occurs prior to the transition to the 
CAM phenotype. Also, the recurrent independent origin of 
CAM in distantly related plant clades (Keeley and Rundel, 
2003), provides evidence that the evolution of CAM probably 
involves the use and modification of genes that are already 
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present in the C 3 ancestors of these species. Alternatively, 
cryptic genetic variants present in common ancestral popula- 
tions could come to be expressed, and subsequently increase 
in frequency, in multiple descendent lineages under similar 
selection regimes (Barrett and Schluter, 2007; West-Eberhard 
et al, 2011). This hypothesis envisions the presence of unex- 
pressed PEPC alleles suitable for CAM and/or C 4 photosyn- 
thesis in the genome of ancestral distantly related C 3 species 
for millions of years, which can then be incorporated into 
CAM or C 4 species through increased expression of either 
formerly cryptic or universally adaptive genes. These alleles 
could then become fixed through natural selection in popula- 
tions in which they are suited to a new photosynthetic mode 
(West-Eberhard et al, 2011; Christin et al, 2012a). There is 
increasing evidence for the importance of cryptic and stand- 
ing genetic variation in evolution (Gibson and Dworkin, 2004; 
Barrett and Schluter, 2007; McGuigan and Sgro, 2009) and 
no special mechanism is required for such variation to explain 
the convergent use of ancestral alleles in a new context. 

Within the Orchidaceae, the presence of CAM is evolution- 
arily labile and prone to parallel evolution and reversal events 
especially within clades that contain large numbers of epi- 
phytic species (subfamily Epidendroideae; Silvera et al. , 2009, 
2010a). The association of CAM with semi-arid or arid envi- 
ronments and microhabitats, or other stressful conditions, 
suggests a role for environmental influences in its recurrent 
origin and evolution (West-Eberhard, 2003). This sugges- 
tion is also supported by the observation that the extent of 
CAM expression often correlates with the degree of special- 
ized adaptations to more xeric ecological niches (Kluge et al. , 
2001; Pierce et al, 2002; Zotz, 2004). The recurrent evolution 
of CAM reflects strong selection under conditions in which 
CAM might afford an advantage, as in the epiphytic habitat 
of orchids. Advantageous biochemical shifts and molecular 
genetic rearrangements could modulate changes in mRNA 
or protein expression patterns from C 3 to CAM, and imply 
a direct role for environmental cues in allowing selection to 
act on variation in underlying genotypes. Also, structural pre- 
cursors such as enlarged vacuoles and tight cell packing may 
need to be present for the evolutionary progression from C 3 
to CAM to occur (Sage, 2002). After CAM becomes estab- 
lished within a lineage, the expression of CAM can be plastic 
or environmentally sensitive, as illustrated by the existence 
of 'facultative', 'inducible', or 'optional' CAM species that 
engage in CAM in response to environmental stimuli such 
as water-deficit stress (Winter, 1985; Griffiths, 1988; Winter 
et al, 2008; Winter and Holtum, 2014). This hypothesis is 
also supported by the observation that the weak-CAM spe- 
cies contain increased numbers of gene duplication events 
compared with the C 3 species, and that these species have 
added the novel capacity for net dark C0 2 fixation to their 
continued capacity to express mostly C 3 photosynthesis. This 
flexibility, combined with the ubiquity of enzymes required to 
perform CAM, might explain why multiple independent ori- 
gins of CAM, as well as reversals, have been observed within 
the Orchidaceae (Silvera et al , 2009). 

In summary, several lines of evidence presented here sug- 
gest that the evolution of CAM ppc genes in orchids involved 



gene duplication coupled with the recruitment of specific gene 
family member for photosynthetic pathway- specific func- 
tions. First, using our cloning approach, increased numbers of 
expressed PEPC isoforms were detected as sampling proceeded 
from C 3 photosynthesis to weak-CAM and strong-CAM spe- 
cies within closely related species of the Oncidiinae. Secondly, 
phylogenetic analysis revealed that ppc genes with the great- 
est relative transcript abundance from C 3 , weak-CAM, and 
strong-CAM species grouped together. This observation sug- 
gests that the identified increases in PEPC mRNA expression 
typical of CAM-specific isogenes were acquired before the spe- 
cies diverged and might indicate parallel rather than conver- 
gent evolutionary tracks for these specific gene lineages. 

The current Oncidiinae ppc dataset lays a strong founda- 
tion for future comparisons of gene lineages expressed in dif- 
ferent tissues of C 3 , weak-CAM, and strong-CAM species. 
However, more comprehensive transcriptomic and genomic 
datasets based on deep-sequencing methods are required to 
identify potentially missing gene lineages and possible new 
PEPC isoforms. For example, gene family members with low- 
abundance transcripts identified in the current study might 
exhibit higher transcript abundance in other species, leading 
to a more detailed understanding of lineage-specific gene 
recruitment patterns among diverse CAM species. In addi- 
tion to conducting detailed sampling of all possible tissues 
types, including leaf, pseudobulb, and root tissues, future 
works should include the sampling of these tissues over more 
detailed time courses in order to identify gene family mem- 
bers and lineages that have acquired pronounced diel or circa- 
dian mRNA expression patterns, which are likely to be useful 
diagnostic indicators for gene recruitment to CAM-specific 
function (Silvera et al, 2010a). In conclusion, this study pro- 
vides clear evidence for the roles of gene duplication and neo- 
functionalization within ppc gene lineages in the evolutionary 
progression of CAM within the Oncidiinae. 

Supplementary data 

Supplementary data is available at JXB online. 

Supplementary Table SI. Gene-specific primer sets used 
for 3' RACE amplification of the cDNA fragment coding for 
PEPC isoform for three species from Oncidiinae. 

Supplementary Table S2. Gene-specific primer sets used 
for confirmation of the cDNA fragment recovered by RACE 
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